CDS

Accession Number TCMCG017C16164
gbkey CDS
Protein Id OMO91675.1
Location join(38152..38188,38289..38411,38581..38626,38958..39079,39241..39420,39518..39605,39891..40045,41378..41496,41675..41757,42181..42302,42510..42674,42804..42891,43348..43414,44466..44658,44782..44900,45012..45094,45512..45633,45934..46047,46160..46247,46359..46452,47419..47506,47611..47729,47879..47961,48143..48264,48731..48895,49298..49391,49551..49737)
Organism Corchorus olitorius
locus_tag COLO4_18191

Protein

Length 1021aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA215141, BioSample:SAMN03160584
db_source AWUE01016434.1
Definition hypothetical protein COLO4_18191 [Corchorus olitorius]
Locus_tag COLO4_18191

EGGNOG-MAPPER Annotation

COG_category I
Description Serine aminopeptidase, S33
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
KEGG_ko ko:K06889        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGCTCAGTCTGATGATCACTCCGCCCAAAATCCAGGAATTGAGCAGCAGAGGGTAATAATTCCAAACAAGCAAGGAGAAAAGCTTGTTGGAATATTACATGAAACTGGGTCTAAAGGAATTGTTGTCTTATGCCATGGTTTCAGAGCCACCAAGGTCGAGAAAGAAGGGATCAGCGCCTTTCGTTTCGATTTCGCCGGAATTGGAGAAAGTGAAGGTTCTTTTGAGTTTGGTAACACTCGGCGAGATGTTGATGATTTGCATACTGTCATCCAGCACTTTTGCGGCGCCAACCGCATAGTAACTGCAGTTCTTGGACATAGCAAAGGAGGCCTTGTGGTGCTTCTATATGCCTCTAAGTATCATGATATCAATACAGTCATCAATGTTTCTAGCCGCTATAATTTAAAGAAAGGAATGGAGGGCCCTTTCACAAAAAGCTTCTTAGGAAAAGACTTCATGGACAGAATTAAGAAGGATGGATTCGTTGATGTTAAGAATCAAACAGGAGAATTTCGTGTGACTTTGGAAAGTCTGATGGATCAGTTGAGTATAAATATGCATGAAGAATGTCTTAAGATACCTAGAGAATGCAGGGTGTTGACAGTCCATGGATCTGCTGATGAAATAACTCCGGTTGAAGAATCGTTCGAGTTTGATAAGACCATACCCAACCATAAACTACACATTGTAGAAGGAGCCAATCATGTCTACACTTCACATAAAACTGAATTAGCATCGGTTGTTTTGAGAATTGAGCAGCAGAGGGTAATAATTCCTAACAAACATGGAGAAAAGCTTGTTGGGTTATTACATGAAACTGGGTCTAAAGGAATTGTTGTCTTATGTCATGGTGTCGGAGCCACTAAGGATCATCCCATCATGGTGAACCTTGCTGTTGCTTTAGAGAAAGAAGGGATCAGTGCCTTTCGTTTCGACTTCGCTGGAATTGGAGAAAGTGAAGGTTCTTCTGGGTTTGGTAACATTATCCAAGAAGTTGATGATTTGCATGCTGTCATCCAACACTTTTGCGGGGCAAACCGCAAAGTAACTGCGATTCTTGGACATAGCAAAGGAGGCCTTGTGGTGCTTCTATATGCCTCTAAGTATCATGATATCCATACAGTTATAAATGTTTGTAGCCGCTATGATTTCAAGAAAGGACTTGAGGGCCCCTTCGGTAAAGACTTCATGGACAGATTTAAGAAGGATGGATTCATTGATGTTAAGAATCCAACAGGAGAATATCGTGTGACTTTGGAAAGTGTGATGGATCTCTTAAGTATCAATATGCATGAAGAATGTCTTAAGATACCTAGAGAATGCAGGGTATTGACAATCCACGGATCTGCCGATGAAATAACACCTGTTGAAGATGCGTTTGAGTTTGCCAAGCTGCTGGACTGTAAGAAGAATAAAGAAATGGAAATTCTTGCTTGTAAATTTCAGCCATTTATCTTAAATTTATCAAATTATAAGGCTCGTTCTAATTTTCTTAAATTTCCTCATCCCCAAATTCACTATAACTCATCTGCAACAACCTTGACCTTGAGGATGGCTCACTCTGATCACTCCGCCCAAAATCCAGGAATTGAGCAGCAGAGAGTTATAATTCCAAACAAGCATGGAGAAAAGCTTGTAGGGTTATTACATGAAACTGGGTCTAAAGAAATTGTTGTCTTATGCCATGGTTTTAGATCCACCAAGGCTGATCAAATCATGGTGAACCTTGCTGTTGCTTTAGAGAAAGAAGGGATTAGCGCCTTTCGTTTCGACTTTGCTGGAAATGGAGAAAGTGAAGGCTCTTTTGAGTTTGGTAACTATCTCCGAGAAGCTGATGATTTGCATGCTGTCATCCAGCACTTTTGCGGGGCAAACCGCACAGTAAGCACAATTCTTGGACATAGTAAAGTTATAAATGCTTCTGGCCGCTATGATTTAAAGAAAGGAATTGAGGAGCGCTTCGGAAAAGACTTCATGGACAGAATTAAGCAGGATGGATTCCTTGATTTTAAGGATAAAAAAGGAGAATATCGTGTGACTTTGGAAAGCCTGATGGATCGCTTAAGTATAAATATGCATGAAGAATGTCTTAAGATTCCTAAAGAATGCAGGGTGTTGACAGTCCATGGATCTGCCGATGAAATCATTCCTGTTGAAGATGCGTTCGAGTTCGCCAAGATCATACCCAACAATGAACTACACATCGTTCGTTTCATCTTATCCGCCAATCGCATCTCCGTTCCCAACTTAACAACTTTGAGGATGGCTCACTCTCTATCTGACCAAAACCCAGTGAATGAGCAGCAGAGAGTGATAATCCCAAATAAGCATGGAGAAAAACTTGTGGGATTATTGCATGAACATGGGTCTAAAGAGATTGTAGTGTTATGTCATGGTTTCAGATCAAGCAAGGACTACACTACAATGAGGACCCTTGTTGCTGCTTTTGAGAAAGAAGGAATCAGTGTCTTCCGGTTTGACTTTGCTGGAAATGGAGAGAGTGAAGGTTCATTTCAGTATGGTAACTATTACCGAGAGGCTGATGATTTGCATGCTGTGATTCAACACCTTTCTGGGGAAAATCGTGTGGTGATTGCGATTCTTGGGCATAGTAAAGGAGGAAATGTGGTGCTTCTCTATGCTTCTAAGTATCAAGATATCCCTATGGTTGTCAACGTTTCTGGCCGCTATGATTTGAAAAGAGGCATTGCAGAACGCTTGGGAGAAGACTTTATGGAAAAAATTAAGAAGGATGGATATATTGATGTTAAGAATAAGCAAGGAGATGTTGAATACCGTGTGACCGAGGAAAGTTTGATGGATCGCTTAGGAACTGATATGCATGAAGCATGCCTTAAGATTGATAAAGACTGTCGGTTGTTGACAGTCCATGGATCTGCTGATGTGATAGTTCCGGTTGAAGATGCATCGTCGTTTGCCAAGATTATACCTAATCACCAGTTACACATTTTGAAAAGAGCCAATCATGGATACACCTTACACCAAACGAAGTTGGCATCAGTTGTTCTGAACTTCATAAAAGACGGTCTTAGCGCCACATAG
Protein:  
MAQSDDHSAQNPGIEQQRVIIPNKQGEKLVGILHETGSKGIVVLCHGFRATKVEKEGISAFRFDFAGIGESEGSFEFGNTRRDVDDLHTVIQHFCGANRIVTAVLGHSKGGLVVLLYASKYHDINTVINVSSRYNLKKGMEGPFTKSFLGKDFMDRIKKDGFVDVKNQTGEFRVTLESLMDQLSINMHEECLKIPRECRVLTVHGSADEITPVEESFEFDKTIPNHKLHIVEGANHVYTSHKTELASVVLRIEQQRVIIPNKHGEKLVGLLHETGSKGIVVLCHGVGATKDHPIMVNLAVALEKEGISAFRFDFAGIGESEGSSGFGNIIQEVDDLHAVIQHFCGANRKVTAILGHSKGGLVVLLYASKYHDIHTVINVCSRYDFKKGLEGPFGKDFMDRFKKDGFIDVKNPTGEYRVTLESVMDLLSINMHEECLKIPRECRVLTIHGSADEITPVEDAFEFAKLLDCKKNKEMEILACKFQPFILNLSNYKARSNFLKFPHPQIHYNSSATTLTLRMAHSDHSAQNPGIEQQRVIIPNKHGEKLVGLLHETGSKEIVVLCHGFRSTKADQIMVNLAVALEKEGISAFRFDFAGNGESEGSFEFGNYLREADDLHAVIQHFCGANRTVSTILGHSKVINASGRYDLKKGIEERFGKDFMDRIKQDGFLDFKDKKGEYRVTLESLMDRLSINMHEECLKIPKECRVLTVHGSADEIIPVEDAFEFAKIIPNNELHIVRFILSANRISVPNLTTLRMAHSLSDQNPVNEQQRVIIPNKHGEKLVGLLHEHGSKEIVVLCHGFRSSKDYTTMRTLVAAFEKEGISVFRFDFAGNGESEGSFQYGNYYREADDLHAVIQHLSGENRVVIAILGHSKGGNVVLLYASKYQDIPMVVNVSGRYDLKRGIAERLGEDFMEKIKKDGYIDVKNKQGDVEYRVTEESLMDRLGTDMHEACLKIDKDCRLLTVHGSADVIVPVEDASSFAKIIPNHQLHILKRANHGYTLHQTKLASVVLNFIKDGLSAT